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Description 

SYSTEMS AND METHODS FOR 
ESTIMATING AN IMAGE MARKING 
PROCESS USING SCANNED IMAGE 
ATTRIBUTES 

BACKGROUND OF THE INVENTION 

1. FIELD OF INVENTION 

[0001] This invention relates to automatically identifying a marking process used to 
form an image on a substrate. 

2. DESCRIPTION OF RELATED ART 

[0002] In order to accurately calibrate a scanner, such as, for example, a color 
scanner, that scans an image carried on a substrate, different calibration 
transformations are required depending on the marking process, such as, for 
example, photography, inkjet printing, xerography, lithography and the like, 
and materials, such as, for example, toner, pigment, ink, etc., that are used 
to form the image on the substrate. For example, a calibration transformation 
that is used to calibrate the scanner for a photographic image is different 
from a calibration transformation that is used to calibrate the scanner for an 
ink jet-printed image, which is in turn different from a calibration 
transformation that is used to calibrate the scanner for a 
xerographically-formed image or for a lithographically-formed image. 



[0003] Typically, a user wishing to scan an image determines the marking process 
used to form the image from prior knowledge of the marking process, 
manually identifies the marking process such as, for example, photographic, 
ink jet, xerographic or lithographic, and uses the marking process 
information to set the scanner so that an appropriate calibration can be 
used. The manual identification is commonly done using different 
descriptions, such as Halftone vs. Photo vs. Xerographic Copy on the user 
interface from which different machine settings are inferred. 

[0004] Approaches to automatically identifying the marking process are disclosed in 
U.S. Patent Nos. 6,353,675 and 6,031,618, each of which is incorporated 
herein by reference in its entirety. The approach to automatically identifying 
the marking process disclosed in the 618 patent uses additional spectral 
information from the scanned material obtained through additional spectral 
channels. The approach used to automatically identify the marking process 
disclosed in the 675 patent involves an image spatial analyzer that analyzes 
image data corresponding to the image to determine at least one spatial 
characteristic based on a power spectrum of the image data and a marking 
process detection system that detects the marking process based on the at 
least one spatial characteristic. 

SUMMARY OF THE INVENTION 

[0005] It would be desirable to perform analyses of the scanned image data directly 
from the scanned data, that is, without using any additional resources, to 
identify the marking process used to form that image. The inventors have 
determined that images carried on substrates exhibit unique spatial 



characteristics that depend upon the type of marking process used to form 
those images. 



[0006] This invention provides methods and systems that automatically identify a 
marking process based on spatial characteristics of the marked image. 

[0007] This invention separately provides systems and methods that automatically 
identify a marking process without the need to add one or more additional 
sensors. 

[0008] This invention separately provides systems and methods that automatically 
identify a marking process without the need to use any additional data 
beyond that obtainable from the marked image using the standard scanner 
sensors. 

[0009] This invention separately provides methods and systems that automatically 
differentiate between continuous tone and binary marking processes. Here, 
it is understood that binary marking processes can be obviously extended to 
marking processes locally using a small number of levels as it is done for 
example some in 7 or 8 head inkjet printing devices. The terms binary and 
halftone are used throughout this application to include those systems. 

[0010] This invention separately provides methods and systems that automatically 
differentiate between different types of binary image marking processes, 
including, for example, inkjet marking processes, xerographic marking 
processes, and lithographic marking processes. 



[001 1] In various exemplary embodiments of the systems and methods according to 
this invention, continuous tone and halftone process images are 



ed by examining local variations of the input data, including using local 
variants as an estimator for local variations of the input data. In various other 
exemplary embodiments of the systems and methods according to this 
invention, image spatial characteristics are identified by checking for halftone 
dot periodicity in the image. In various other exemplary embodiments of the 
systems and methods according to this invention, frequency, frequency 
relationships, and/or noise characteristics of scanned image data are 
employed to identify the image marking process. In various other exemplary 
embodiments of the systems and methods according to this invention, a 
determination whether or not the image has an underlying halftone rendition 
with a clustered or dispersed character may be performed. 

[0012] These and other features and advantages of this invention are described in, 
or are apparent from, the following detailed description of various exemplary 
embodiments of the systems and methods according to this invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0013] Various exemplary embodiments of the systems and methods of this 

invention will be described in detail, with reference to the following figures, 
wherein: 

[0014] Fig. 1 shows one exemplary embodiment of a decision tree for a media 
identification process according to the invention; 

[0015] Fig. 2 shows enlarged views of scanned regions of an image formed using 
different image formation processes; 

[001 6] Fig. 3 shows one exemplary embodiment of a decision tree for a media 



ification process illustrating a correlation between input media type and 
measurable spatial image attributes using statistical differentiators; 



[001 7] Fig. 4 is a flowchart outlining one exemplary embodiment of a method for 

determining the image marking process used to produce an image according 
to this invention; 

[0018] Fig. 5 is a flowchart outlining in greater detail one exemplary embodiment of 
the method for generating data statistics of Fig. 4; 

[0019] Figs. 6 and 7 is a flowchart outlining in greater detail one exemplary 

embodiment of the method for determining the process used to produce a 
given data block of Fig. 4; 

[0020] Fig. 8 illustrates one exemplary embodiment of a histogram of inter-minority 
distance; and 

[0021] Fig. 9 is a functional block diagram of one exemplary embodiment of a 
system used to identify media/image marking process according to this 
invention. 

DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS 

[0022] The inventors have determined that there is a strong correlation between the 
input media type and a number of measurable spatial image attributes 
obtainable directly from the scanned image data itself. Because there is a 
strong correlation between the input media type and these measurable 
spatial image attributes, the marking process used to form the scanned 
original can be ascertained, with a relatively high degree of confidence, from 



the statistical spatial properties of the scanned image data. 

[0023] Typically, photographic printing, as well as any other analog image printing 
process, is a continuous tone, or "contone", marking process. Binary 
printing, however, typically involves a halftone process. Inkjet printing, for 
example, primarily or typically uses error diffusion/stochastic screens, while 
xerography, including color xerography, primarily or typically uses 
line-screens and/or clustered dot screens, and lithography primarily or 
typically uses clustered-dot rotated halftone screens. It should be 
appreciated that any of these binary marking techniques could have one of 
these halftone processes. However, the choices outlined above are 
predominant in typical usage, because of image quality and stability 
considerations. 

[0024] Black and white images have variations in lightness and darkness. Color 
images have variations in color. Whereas variations in continuous tone 
images arise from variations in image data, halftone images have variations 
both from the image data and from the halftone reproduction process itself. 
Variations arising from the image data typically occur over much larger 
scales than the variations occur in halftone processes. Therefore, over a 
small scale, continuous tone images, such as photographic images, typically 
have a much smaller variation than do halftone images. Based on this, 
various exemplary embodiments of the systems and methods according to 
this invention look at local variations within the scanned image data to 
identify which marking process was used to render the image. That is, 
various exemplary embodiments of the systems and methods according to 
this invention look at local variations within the scanned image data to 



determine whether a continuous tone or photographic image marking 
process was used, or whether a halftone marking process was used. That is, 
in various exemplary embodiments of the systems and methods according to 
this invention, continuous tone image marking processes are differentiated 
from halftone image marking processes by examining local variations of the 
marked image input data. 

[0025] Fig. 1 illustrates one exemplary embodiment of a decision tree 1 00 usable to 
perform image marking process/media identification according to the 
invention. In the decision tree 100 shown in Fig. 1, all image data 105 is 
evaluated. The first decision point 110 differentiates between a continuous 
tone image marking process 120 and a halftone image marking process 125 
in a scanned image by examining local variations of the scanned image 
input data to determine whether there is low local/spatial variation 1 15 in the 
scanned image data or high local/spatial variation 1 16 in the scanned image 
data. 

[0026] This distinction coincides with the distinction between a photograph or other 
analog image marking process and a binary image marking process. That is, 
determining continuous tone image data would imply that the image marking 
process for the scanned image data is a photo process, i.e., that the image 
is a photo 121 . 

[0027] Detecting a halftone marking process 125 would imply that the image 

marking process for the scanned image data is an ink-jet marking process 
140, a xerographic marking process 145, an offset marking process 146, or 
the like. 



[0028] In the exemplary embodiment of the decision tree 100 shown in Fig. 1 , the 
next decision point 130 differentiates between the various halftone image 
marking processes 140, 145 and 146 by examining the spatial 
characteristics of the scanned image data to determine whether the data has 
a dispersed/aperiodic character 135 or a clustered/periodic character 136. 

[0029] Detecting data having a dispersed/aperiodic character would imply that the 
image marking process for the scanned image data is an ink-jet marking 
process 140, i.e., that the image is an ink-jet image 141. On the other hand, 
detecting data having a clustered/periodic character would imply that the 
image marking process for the scanned image data is a xerographic marking 
process 145, an offset marking process 146, or the like. 

[0030] In the exemplary embodiment of the decision tree 100 shown in Fig. 1 , the 
next decision point 150 differentiates between a xerographic marking 
process 160 and an offset marking process 165 by examining the data 
frequency distribution or internal structure of the scanned image data. Image 
data internal structure examples that may be considered include determining 
whether the image data has a line structure as contrasted with a rotated 
structure, whether the halftone dots have a high frequency structure versus 
a low frequency structure, and whether the halftone screen noise is high or 
low. 

[0031] Detecting image data having a low frequency/high noise character 155 

would imply that the image marking process for the scanned image data is a 
xerographic marking process 1 60 that was used to create a xerographic 
image 1 61 . On the other hand, detecting image data having a high 



frequency/low noise character 156 would imply that the image marking 
process for the scanned image data is an offset, or lithographic, marking 
process 165 that was used to generate an offset printed/ lithographic image 
166. 

[0032] The decision tree of Fig. 1 is not intended to imply that data can not be 

reevaluated. In some cases, for example, data identified as ink-jet 141 might 
still be evaluated with respect to the data frequency distribution 150 and the 
result of this being used to verify, harden or reexamine the identification of 
the marking process of the image as an ink-jet marking process 140. The 
additional burden with respect to speed, processing time, etc. for verification 
is system dependent and might be negligible, in which case reexamination is 
advantageous. In other cases, a strict structure like the one shown in Fig. 1 
is advisable. In addition, as will be appreciated by those skilled in the art, the 
decision process can be applied to the entire image as a single 
determination or can be applied individually or to parts of the image. These 
independent image portions may be determined by segmenting the image 
through an independent process. Furthermore, the decision process may be 
independently applied to small regions of the image and the results from 
these regions may then be pooled or combined to determine an image 
marking process. This pooling or combination can further use a measure of 
confidence for each region when determining the overall marking process. 

[0033] Fig. 2 shows in detail a number of scanned regions of a photograph 210, an 
inkjet marked image region 220, a lithographically-formed image region 230 
and a xerographically-formed image region 240, scanned, for example, at 
600 dots per inch (dpi). As shown in Fig. 2, the continuous or photographic 



image region 21 0 has a much smaller variation in the number of adjacent 
light and dark areas throughout the scanned region than do the halftone-type 
image regions 220-240. Additionally, as shown in Fig. 2, the halftone dots of 
the inkjet image region 220 have an aperiodic dispersed nature, while the 
halftone dots of the lithographically-formed image region 230 and the 
xerographically-formed image region 240 have strong periodic structures. 
Finally, as shown in Fig. 2, the lithographically-formed image region 230 has 
a higher spatial frequency of halftone dots and lower noise than does the 
xerographically-formed image region 240. 

[0034] Fig. 3 is a decision tree illustrating the correlation of the scanned image data 
with the input media determination process of Fig. 1 using statistical 
differentiators at each decision point 310, 320 and 330. In the exemplary 
embodiment shown in Fig. 3, just as in the exemplary embodiment shown in 
Fig. 1 , the first decision block 310 differentiates between analog tone and 
binary image marking processes. As shown in Fig. 3, this is achieved by 
examining the local variations of the input data. An image formed by a binary 
image marking process typically shows a relatively high level of local 
variation compared to an image formed using an analog image marking 
process, such as a continuous tone image marking process, such as, for 
example, a photographic, image marking process. Accordingly, local feature 
variants may be used as an estimator for the local variation at this stage. As 
a result of the analysis in the first decision block 310, images created using 
an analog or continuous tone image marking process 315, such as, for 
example a photo image marking process 315, are separated from images 
created using other image marking processes. 



[0035] The second decision block 320 of Fig. 3 differentiates between an inkjet 
image forming process 325 and other halftone image marking processes, 
such as, for example, a xerographic image marking process 335, an offset or 
lithographic image marking process 345, or the like. This is accomplished by 
examining various spatial characteristics of the scanned image data to 
determine whether the data has a dispersed/aperiodic character or a 
clustered/periodic character. In various exemplary embodiments, the second 
decision block 320 differentiates between an inkjet image marking process 
325, and a xerographic image marking process 335 or an offset image 
marking process 345 by evaluating the rendering uniformity and periodicity 
of the observed spatial variation of the halftone dots to discriminate between 
clustered and dispersed dot rendering methods. 

[0036] For example, inkjet-formed marking processes 325 use mainly distributed 
dot techniques, such as, for example, error diffusion, stochastic screening 
and/or blue noise screening. These processes commonly do not have a 
single fundamental periodicity across all gray levels. However, distributed 
dot techniques are extremely uncommon for xerographic image marking 
processes 335 or for lithographic or offset image marking processes 345. 
Xerographic image marking processes 335 and lithographic or offset image 
marking processes 345 typically use clustered halftone dot techniques that 
have a dot periodicity that is not a function of the input level. At the same 
time, distributed dot techniques have a higher uniformity than do clustered 
dot techniques. 

[0037] The third decision block 330 of Fig. 3 differentiates between xerographic 
image marking processes 335 and offset or lithographic image marking 



processes 345 by analyzing frequency and noise characteristics of the 
scanned data. In one exemplary embodiment, the third decision block 330 
differentiates between xerographic image marking processes 335 and offset 
or lithographic image marking processes 345 by evaluating the symmetry 
and frequency of the halftone dots. In general, line screens are common in 
xerographic image marking processes 335, but are uncommon in offset or 
lithographic image marking processes 345. Rotated dot schemes are also 
common in xerographic image marking processes. Based on these 
tendencies, the absolute frequency of the input screen, and its noise 
characteristics can be analyzed as part of the third decision block 330. In 
particular, high frequency, low noise screens may be associated with offset 
or lithographic image marking processes 345, while low frequency, high 
noise screens may be associated with xerographic image marking processes 
335. 

[0038] As noted above, in various exemplary embodiments of the systems and 
methods according to this invention, a group of pixels from a fairly small 
block or sub-region that may be considered to be roughly homogenous in 
terms of color or gray value can be examined. Since the image has no 
spatial variation over a homogeneous region, the spatial structure in the 
halftoned version of the image is entirely due to the halftoning technique. 
Such regions are therefore useful for analyzing the underlying halftone 
technique without interference from the image content. Often binarizing a 
related group of pixels in the block will reveal the spatial arrangements that 
take place in the image marking process, that is, halftone marking process or 
continuous tone marking process. Accordingly, in various exemplary 
embodiments of the systems and methods according to the invention, a 



block of a related group of image pixels is binarized to create a map that is 
indicative of image marking processes. 

[0039] Fig. 4 is a flowchart outlining one exemplary embodiment of a method for 
determining from scan image data of an image, the image marking process 
used to create an image according to this invention. As shown in Fig. 4, the 
method begins in step S1 000, and continues to step S1 1 00, where the 
scanned image data is divided into one or more data blocks, each having a 
determined number of pixels. In various exemplary embodiments of the 
methods and systems according to this invention, the scanned image data 
may be divided into data blocks or areas having any desired number of 
pixels. In one exemplary embodiment, the scanned image data may be 
divided into data blocks having 60x60 pixels for scanned images at 600 dpi. 
This division into blocks could be based on pure spatial considerations, e.g. 
location, but might also be influenced by additional information such as given 
by image segmenters and the like. 

[0040] Then, in step S1200, the one or more image data blocks are selected to be 
analyzed or processed. In various exemplary embodiments of the methods 
and systems according to this invention, to obtain low-noise data, data 
blocks or areas that represent constant or near constant image data are 
preferably selected in step S1200. This tends to exclude image edges, paper 
background, and the like. 

[0041] Next, in step S1300, each of the selected one or more image data blocks is 
processed to generate one or more data statistics for that image data block. 
In various exemplary embodiments of the methods and systems according to 



this invention, the one or more data statistics generated for the one or more 
image data blocks may include determining an average or mean value of the 
pixels for the image data block being processed, determining a variance 
value of the pixels for the image data block, determining the extremes, such 
as, for example, the minimum value, min a , and maximum value, max a , of the 
pixels for the image data block, generating histograms of the data being 
processed, and performing various data evaluations using the determined 
statistical values and histograms. To estimate if the input has significant and 
consistent periodicity, it is particularly beneficial to locate local minima along 
traversals through the image block, determine the distances between 
successive minima, and determine histograms of these inter-minima 
distances. A strong peak in a histogram of inter-minimum distances indicates 
that a large number of minima are separated by a constant period, thereby 
implying periodicity. Local maxima can similarly be used, and a decision 
between the use of minima and maxima may be made based on image level, 
for instance. Operation then continues to step S1400. 

[0042] In step S1400, the one or more data statistics generated for the one or more 
image data blocks are compared with image data statistics already 
determined and provided in an image data statistics model. Next, in step 
S1500, the results of comparing the one or more data statistics generated in 
step S1300 for the one or more image data blocks are used to determine the 
specific image marking process used to format the image. Operation then 
continues to step S1 600, where operation of the method stops. 

[0043] It should be appreciated that, in various exemplary embodiments, step 

S1400 can be omitted. In this case, operation of the method would proceed 



directly from step S1300 to step S1500. In general, step S1400 can be 
skipped. 

[0044] Fig. 5 is a flowchart outlining in greater detail one exemplary embodiment of 
the method for generating the data statistics of Fig. 4. As shown in Fig. 5, 
operation of the method begins in step S1300 and continues to step S1310, 
where statistical values or parameters are determined over the selected data 
block or pixel area. In various exemplary embodiments, any or all of a 
number of statistical values or parameters may be determined, such as, for 
example, an area average or mean <A> of the pixels for the image data 
block, an area variance # a of the pixels for the image data block, and the 
extreme minima and maxima values, min a and max a of the pixels for the 
image data block. The determined statistical values or parameters may be 
determined using well known spatial statistics methods or techniques. 

[0045] Then, in step S1320, various data evaluations are performed using the 

determined statistical values or parameters. In one exemplary embodiment 
of the methods and systems according to this invention, data evaluations 
may include determining a ratio of the area variance # a to mean <A> 
determined for a given block, determining the distribution of the mean values 
<A> for large pixel areas, comparing the determined mean value <A> to the 
determined min a and/or max a values, determining a distance between local 
maxima/minima, and the like. 

[0046] Next, in step S1330, histograms are generated using the results of the data 
evaluations performed in step S1320. Then, in step S1340, operation returns 
to step S1500. 



[0047] Figs. 6 and 7 illustrate a flowchart outlining in greater detail one exemplary 
embodiment of determining the image marking process of Fig. 4. As shown 
in Figs. 6 and 7, operation of the method begins in step S1500 and 
continues to step S1505, where local variations in image data are evaluated 
to distinguish between a continuous tone image marking process and other 
halftone marking processes. In various exemplary embodiments of the 
methods and systems according to this invention, in step S1505, area 
variance is used as an estimator for local variation in the image data. In 
various exemplary embodiments, the area variance to mean ratio is used to 
evaluate local variations in the image data. The area variance to mean ratio 
is directly used to distinguish halftone marking processes from a continuous 
tone marking process or background areas, as discussed below. 

[0048] Then, in step S1510, a determination is made whether the image data 
evaluated exhibits high local variation. As discussed above, a continuous 
tone image, for example, a scanned photographic image, exhibits a much 
smaller local variation than halftone images, such as, for example, an 
inkjet-formed image, a xerographically-formed image or a 
lithographically-formed image. If the image data does not exhibit high local 
variation, it is likely that the image marking process used to form the image 
is a continuous tone image marking process or the image data contains 
significant background noise. It should be noted that in any image marking 
process, some local areas might exhibit low variance, for example in image 
highlight and shadow regions, or in other solid color areas. Accordingly, if the 
image data does not exhibit high local variation, operation continues to step 
S1515. If image data exhibits high local variation, operation continues to 
step S1535. 



[0049] As shown in Fig. 7, in step S1515, a distribution of the mean value over 

large data blocks/areas is determined or analyzed to distinguish between a 
continuous tone image marking process and background noise. Next, in step 
S1520, a determination is made whether the distribution of the mean value is 
characteristic of a continuous tone marking process. If so, operation 
continues to step S1525. Otherwise, operation jumps to step S1530. In step 
S1525, the image marking process is identified as or determined to be a 
photographic image marking process. Operation then jumps to step S1570. 
In contrast, in step S1530, the image data is identified and/or classified as 
background noise. Operation then also jumps to step S1570. It should be 
appreciated, that, if the background data blocks were not suppressed, their 
classification as "photo"data blocks could swamp all rendering-derived 
image signatures. 

[0050] As shown in Fig. 6, in step S1535, the image data is evaluated for its 
periodicity characteristics. In various exemplary embodiments of the 
methods and systems according to this invention, in step S1535, the data 
block mean value is compared to the determined min a and max a values to 
distinguish the minority pixels in the distribution. The minority pixels are 
generally either light pixels on a dark background or dark pixels on a light 
background. This distinction is made as noise suppression, such that only 
minority pixels are analyzed further because the halftone characteristics are 
better identified by considering the distribution of the minority pixels. 

[0051] Next, in step S1540, a determination is made whether the evaluated image 
data has a clustered character with high periodicity. If image data does not 
have high periodicity, operation continues to step S1545. Otherwise, 



operation jumps to step S1550. In step S1545, the image marking process 
used to create the scanned image is determined to be an inkjet image 
marking process. As discussed above, inkjet-based marking processes use 
mainly distributed dot techniques, such as, for example, error diffusion, 
stochastic screening, frequency modulation, and/or blue noise screening, 
which do not have a single fundamental periodicity across all gray levels. 
Operation then jumps to step S1570. 

[0052] In contrast, in step S1550, the frequency and noise characteristics of the 
scanned image data are evaluated to further distinguish between a 
xerographic image marking process and an offset-marking process. In 
various exemplary embodiments of the methods and systems according to 
this invention, in step S1550, the absolute frequency of the input screen is 
determined and the noise characteristics of the screen are examined. In one 
exemplary embodiment, in step S1550, after the minority pixels are 
identified, the distance between maxima/minima corresponding to 
subsequent minority pixels is determined, excluding a small region around 
the mean to exclude noise. 

[0053] Next, it step S1555, a determination is made whether the scanned image 
data has a low frequency, high noise character. If so, operation continues to 
step S1560. Otherwise, operation jumps to step S1565. In step S1560, 
image marking process is determined to be, and/or is identified as, a 
xerographic image marking process. Operation then jumps to step S1570. In 
contrast, in step S1565, the image marking process is determined to be, 
and/or is identified as, an offset image marking process because high 
frequency, low noise screens are correlated with offset input. Operation then 



continues to step S1570, where the operation of the method returns to step 
S1600. 



[0054] Fig. 8 illustrates one exemplary embodiment of a histogram of the 

inter-maxima/minima distance between minority pixels for a single image 
area formed using an inkjet image marking process, a 
xerographically-formed image marking process and an offset image marking 
process, based on the results generated in step S1500 of Fig. 4. As shown 
in Fig. 8, different media types may be distinguished. For example, the inkjet 
image marking process curve 630 is clearly distinguishable, having a 
strongly different frequency characteristic with no clear periodicity. On the 
other hand, the offset image marking process curve 61 0 and the 
xerographically-formed image marking process curve 620 both show strong 
periodicity. 

[0055] Further, as shown in Fig. 8, the offset image marking process curve 610 and 
the xerographic image marking process curve 620 are further distinguishable 
by the higher frequency, i.e., closer spacing of the peaks, in the offset image 
marking process curve 610, shown as peaks to the left of xerographic image 
marking process curve 620 in Fig. 8. A secondary indicator identifying the 
xerographic image marking process curve 620 is the high amplitude of the 
high frequency sidelobe at a periodicity of 4-5 pixels. 

[0056] Fig. 9 illustrates a functional block diagram of one exemplary embodiment of 
the media/image marking process identification system 400 according to this 
invention. The media/image marking process identification system 400 may 
be a stand alone system or may be connected to a network (not shown) via 



the link 414. The link 414 can be any known or later developed device or 
system for connecting the media/image marking process identification 
system 400 to the network, including a connection over public switched 
telephone network, a direct cable connection, a connection over a wide area 
network, a local area network or a storage area network, a connection over 
an intranet or an extranet, a connection over the Internet, or a connection 
over any other distributed processing network or system. In general, the link 
414 can be any known or later-developed connection system or structure 
usable to connect the media/image marking process identification system 
400 to the network. 

[0057] As shown in Fig. 9, the media/image marking process identification system 
400 may include one or more display devices 470 usable to display 
information to one or more users, and one or more user input devices 475 
usable to allow one or more users to input data into the media/image 
marking process identification system 400. The one or more display devices 
470 and the one or more input devices 475 are connected to the 
media/image marking process identification system 400 through an 
input/output interface 410 via one or more communication links 471 and 476, 
respectively, which are similar to the communication link 414 above. 

[0058] In various exemplary embodiments, the media/image marking process 

identification system 400 includes one or more of a controller 420, a memory 
430, an image data local variation differentiation circuit, routine or application 
440, an image data spatial characteristics differentiation circuit, routine or 
application 450, an image data frequency distribution circuit, routine or 
application 460, an image data statistics generation circuit, routine or 



application 470, and a media/image marking process determination circuit, 
routine or application 480, which are interconnected over one or more data 
and/or control buses and/or application programming interfaces 492. The 
memory 430 includes one or more of a media/image marking process 
identification model 432. 

[0059] The controller 420 controls the operation of the other components of the 

media/image marking process identification system 400. The controller 420 
also controls the flow of data between components of the media/image 
marking process identification system 400 as needed. The memory 430 can 
store information coming into or going out of the media/image marking 
process identification system 400, may store any necessary programs and/or 
data implementing the functions of the media/image marking process 
identification system 400, and/or may store data and/or user-specific 
information at various stages of processing. 

[0060] The memory 430 includes any machine-readable medium and can be 
implemented using appropriate combination of alterable, volatile or 
non-volatile memory or non-alterable, or fixed, memory. The alterable 
memory, whether volatile or non-volatile, can be implemented using any one 
or more of static or dynamic RAM, a floppy disk and disk drive, a writable or 
re-rewriteable optical disk and disk drive, a hard drive, flash memory or the 
like. Similarly, the non-alterable or fixed memory can be implemented using 
any one or more of ROM, PROM, EPROM, EEPROM, an optical ROM disk, 
such as a CD-ROM or DVD-ROM disk, and disk drive or the like. 



[0061] In various exemplary embodiments, the media/image marking process 



identification model 432 which the media/image marking process 
identification system 400 employs to identify the media and/or image 
marking process used to process a particular medium is based on the image 
data analysis techniques discussed above to determine local variations of 
the input data, identify image data spatial characteristics, determine image 
data frequency distributions, and the like. 

[0062] With reference to Figs. 1 and 9, the image data local variation differentiation 
circuit, routine or application 440 is activated by the controller 420 to 
differentiate between a continuous tone image marking process 120 and a 
halftone image marking process 125 in a scanned image by examining local 
variations of the scanned image input data to determine whether there is low 
local/spatial variation 1 15 in the scanned image data or high local/spatial 
variation 1 1 6 in the scanned image data. 

[0063] This distinction coincides with the distinction between a photograph or other 
analog image marking process and a binary image marking process. That is, 
determining continuous tone image data would imply that the image marking 
process for the scanned image data is a photo process, i.e., that the image 
is a photo 121 . 

[0064] As discussed above, detecting a halftone marking process 125 would imply 
that the image marking process for the scanned image data is an ink-jet 
marking process 140, a xerographic marking process 145, an offset marking 
process 146, or the like. 

[0065] The image data spatial characteristics differentiation circuit, routine or 

application 450 is activated by the controller 420 to differentiate between the 



various halftone image marking processes 140, 145 and 146 by examining 
the spatial characteristics of the scanned image data to determine whether 
the data has a dispersed/aperiodic character 135 or a clustered/periodic 
character 136. 

[0066] Detecting data having a dispersed/aperiodic character would imply that the 
image marking process for the scanned image data is an ink-jet marking 
process 140, i.e., that the image is an ink-jet image 141. On the other hand, 
detecting data having a clustered/periodic character would imply that the 
image marking process for the scanned image data is a xerographic marking 
process 145, an offset marking process 146, or the like. 

[0067] The image data frequency distribution circuit, routine or application 460 is 
activated by the controller 420 to differentiates between a xerographic 
marking process 1 60 and an offset marking process 1 65 by examining the 
data frequency distribution or internal structure of the scanned image data. 
Image data internal structure examples that may be considered include 
determining whether the image data has a line structure as contrasted with a 
rotated structure, whether the halftone dots have a high frequency structure 
versus a low frequency structure, and whether the halftone screen noise is 
high or low. 

[0068] Detecting image data having a low frequency/high noise character 155 

would imply that the image marking process for the scanned image data is a 
xerographic marking process 1 60 that was used to create a xerographic 
image 1 61 . On the other hand, detecting image data having a high 
frequency/low noise character 156 would imply that the image marking 



process for the scanned image data is an offset, or lithographic, marking 
process 165 that was used to generate an offset printed/ lithographic image 
166. 



[0069] The image data statistics generation circuit, routine or application 470 is 

activated by the controller 420 to generate one or more data statistics of the 
image data, as discussed above, which are then are analyzed by one or 
more of the circuits, routines or applications 420, 430, 440. 

[0070] The media/image marking process determination circuit, routine or 

application 480 is activated by the controller 420 to determine the type of 
image marking process used to process the image data evaluated or 
analyzed. 

[0071] A fully automated approach for detecting the input image marking process 
based on the spatial statistics of the scanned image has been described. 
Because the spatial statistics of the scanned image are highly correlated 
with the underlying reproduction process, the methods and systems 
according to various exemplary embodiments of the invention allow for a 
reliable classification of the type of the image marking process. It is also well 
understood that any automated approach can be used in a semi-automatic 
fashion to aid a user, either by preferentially guiding user decisions, by 
setting system defaults, by alerting users to discrepancies, or the like. 

[0072] Although the above discussion first selects blocks of pixels to be used for 
image analysis, then creates statistical data indicative of a marking process, 
then creates a dispersion metric for the blocks, then creates a periodicity 
metric, this order may be changed, especially if the input image marking 



processes have some sort of pre-classification. Moreover, because the 
metrics described above have been shown to be sequentially derived, some 
classification decisions may be made earlier than others. It should also be 
noted that a periodicity metric may also be considered to be a noise metric 
because a periodicity metric compares amplitudes and harmonics. 

[0073] While this invention has been described with reference to a color scanner, 
the invention is not limited to such an embodiment. The invention may be 
applied to scanned image data captured at a remote location or to image 
data captured from a hard copy reproduction by a device other than a 
scanner, for example a digital camera. The invention may be practiced on 
any color reproduction device, such as, for example a color photocopier, and 
is also not intended to be limited to the particular colors described above. 

[0074] While this invention has been described in conjunction with specific 
embodiments outlined above, it is evident that many alternatives, 
modifications and variations will be apparent to those skilled in the art. 
Accordingly the preferred embodiments of the invention as set forth above 
are intended to be illustrative and not limiting. Various changes may be 
made without departing from the spirit and scope of the invention as defined 
in the following claims. 



